Data Dependent Peak Model Based Spectrum Deconvolution for Analysis of High Resolution LC-MS Data

نویسندگان

  • Xiaoli Wei
  • Xue Shi
  • Seongho Kim
  • Jeffrey S. Patrick
  • Joe Binkley
  • Maiying Kong
  • Craig McClain
  • Xiang Zhang
چکیده

A data dependent peak model (DDPM) based spectrum deconvolution method was developed for analysis of high resolution LC-MS data. To construct the selected ion chromatogram (XIC), a clustering method, the density based spatial clustering of applications with noise (DBSCAN), is applied to all m/z values of an LC-MS data set to group the m/z values into each XIC. The DBSCAN constructs XICs without the need for a user defined m/z variation window. After the XIC construction, the peaks of molecular ions in each XIC are detected using both the first and the second derivative tests, followed by an optimized chromatographic peak model selection method for peak deconvolution. A total of six chromatographic peak models are considered, including Gaussian, log-normal, Poisson, gamma, exponentially modified Gaussian, and hybrid of exponential and Gaussian models. The abundant nonoverlapping peaks are chosen to find the optimal peak models that are both data- and retention-time-dependent. Analysis of 18 spiked-in LC-MS data demonstrates that the proposed DDPM spectrum deconvolution method outperforms the traditional method. On average, the DDPM approach not only detected 58 more chromatographic peaks from each of the testing LC-MS data but also improved the retention time and peak area 3% and 6%, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DeMix Workflow for Efficient Identification of Cofragmented Peptides in High Resolution Data-dependent Tandem Mass Spectrometry*

Based on conventional data-dependent acquisition strategy of shotgun proteomics, we present a new workflow DeMix, which significantly increases the efficiency of peptide identification for in-depth shotgun analysis of complex proteomes. Capitalizing on the high resolution and mass accuracy of Orbitrap-based tandem mass spectrometry, we developed a simple deconvolution method of "cloning" chimer...

متن کامل

Data pre-processing in liquid chromatography-mass spectrometry-based proteomics

MOTIVATION In a liquid chromatography-mass spectrometry (LC-MS)-based expressional proteomics, multiple samples from different groups are analyzed in parallel. It is necessary to develop a data mining system to perform peak quantification, peak alignment and data quality assurance. RESULTS We have developed an algorithm for spectrum deconvolution. A two-step alignment algorithm is proposed fo...

متن کامل

Data preprocessing method for liquid chromatography-mass spectrometry based metabolomics.

A set of data preprocessing algorithms for peak detection and peak list alignment are reported for analysis of liquid chromatography-mass spectrometry (LC-MS)-based metabolomics data. For spectrum deconvolution, peak picking is achieved at the selected ion chromatogram (XIC) level. To estimate and remove the noise in XICs, each XIC is first segmented into several peak groups based on the contin...

متن کامل

Measuring Drug-to-Antibody Ratio (DAR) for Antibody-Drug Conjugates (ADCs) with UHPLC/Q-TOF

In this paper, we are investigating an antibody drug conjugate (ADC) with a cysteine linker. The Agilent 1290 Infinity II liquid chromatography system connected to a PLRP-S reversed phase liquid chromatography column was used to separate restored light and heavy chains and light and heavy chains of the corresponding drug linkers. Each chromatographic peak was verified through the Agilent 6530 H...

متن کامل

GlyQ-IQ: Glycomics Quintavariate-Informed Quantification with High-Performance Computing and GlycoGrid 4D Visualization

Glycomics quintavariate-informed quantification (GlyQ-IQ) is a biologically guided glycomics analysis tool for identifying N-glycans in liquid chromatography-mass spectrometry (LC-MS) data. Glycomics LC-MS data sets have convoluted extracted ion chromatograms that are challenging to deconvolve with existing software tools. LC deconvolution into constituent pieces is critical in glycomics data s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 86  شماره 

صفحات  -

تاریخ انتشار 2014